Binaural Cue Coding: Rendering of Sources Mixed into Amono Signal

نویسنده

  • Christof Faller
چکیده

This paper reviews Binaural Cue Coding (BCC). BCC is a lossy technique for either reducing either a number of source signals or a multichannel audio signal to one audio channel plus side information. In the case when a number of source signals (e.g. separately recorded instruments) are reduced to one audio channel plus side information, the BCC synthesis allows rendering of each source as if the separate source signals were given. The same side information supports different playback setups (e.g. binaural headphone playback, stereo loudspeaker playback, multi-channel loudspeaker playback). In the case when a stereo or multi-channel audio signal is reduced to one audio channel plus side information, the BCC synthesis renders a stereo or multi-channel audio signal with a spatial image similar to the original signal. The BCC representation of audio signals requires only the bitrate for one audio channel, which can be coded with conventional audio or speech coders, plus a few kb/s for the BCC side information. Existing mono broadcasting or communications systems can be upgraded with BCC for spatial audio by transmitting the BCC side information in addition to the existing mono audio transmission.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Binaural cue coding-Part I: psychoacoustic fundamentals and design principles

Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and BCC side information. The BCC side information has a low data rate and it is derived from the multichannel encoder input signal. A natural application of BCC is multichannel audio data rate reduction since only a single down-mixed audio channel needs to be transmitted. An alternati...

متن کامل

Binaural cue coding-Part II: Schemes and applications

Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and side information. The companion paper (Part I) covers the psychoacoustic fundamentals of this method and outlines principles for the design of BCC schemes. The BCC analysis and synthesis methods of Part I are motivated and presented in the framework of stereophonic audio coding. Th...

متن کامل

Parametric Coding of Stereo Audio Based on Principal Component Analysis

Low bit rate parametric coding of multichannel audio is mainly based on Binaural Cue Coding (BCC). Another multichannel audio processing method called upmix can also be used to deliver multichannel audio, typically 5.1 signals, at low data rates. More precisely, we focus on existing upmix method based on Principal Component Analysis (PCA). This PCA-based upmix method aims at blindly create a re...

متن کامل

Härmä and Faller Spatial Decomposition

Techniques where a stereo or a multichannel signal is decomposed into spatial source-labeled time-frequency slots by level, time-difference, and coherence metrics have become popular in recent years. Good examples are binaural cue coding and up/downmixing techniques. In the article, we will provide an overview and discuss parallel approaches in the field of array processing and blind source sep...

متن کامل

Spatial Hearing Algorithms Based on Binaural Zero-Crossings: Sound Source Localization, Segregation, and Dereverberation

This thesis concerns a new zero-crossing-based binaural model for spatial hearing. Conventional binaural model computes cross-correlations of binaural signals for the estimation of the interaural time difference which is a primary spatial cue. However, the cross-correlationbased binaural processing model requires high computational complexity and suffers from inaccuracies in localizing sound so...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004